Search CORE

23 research outputs found

Some Requests for Machine Learning Research from the East African Tech Scene

Author: Cvitkovic Milan
Publication venue
Publication date: 07/11/2018
Field of study

Based on 46 in-depth interviews with scientists, engineers, and CEOs, this document presents a list of concrete machine research problems, progress on which would directly benefit tech ventures in East Africa.Comment: Presented at NIPS 2018 Workshop on Machine Learning for the Developing Worl

arXiv.org e-Print Archive

Crossref

Caltech Authors

Open Vocabulary Learning on Source Code with a Graph-Structured Cache

Author: Anandkumar Anima
Cvitkovic Milan
Singh Badal
Publication venue
Publication date: 19/05/2019
Field of study

Machine learning models that take computer program source code as input typically use Natural Language Processing (NLP) techniques. However, a major challenge is that code is written using an open, rapidly changing vocabulary due to, e.g., the coinage of new variable and method names. Reasoning over such a vocabulary is not something for which most NLP methods are designed. We introduce a Graph-Structured Cache to address this problem; this cache contains a node for each new word the model encounters with edges connecting each word to its occurrences in the code. We find that combining this graph-structured cache strategy with recent Graph-Neural-Network-based models for supervised learning on code improves the models' performance on a code completion task and a variable naming task --- with over

100\%

relative improvement on the latter --- at the cost of a moderate increase in computation time.Comment: Published in the International Conference on Machine Learning (ICML 2019), 13 page

arXiv.org e-Print Archive

Caltech Authors

A General Method for Amortizing Variational Filtering

Author: Cvitkovic Milan
Marino Joseph
Yue Yisong
Publication venue: Curran Associates, Inc.
Publication date: 13/11/2018
Field of study

We introduce the variational filtering EM algorithm, a simple, general-purpose method for performing variational inference in dynamical latent variable models using information from only past and present variables, i.e. filtering. The algorithm is derived from the variational objective in the filtering setting and consists of an optimization procedure at each time step. By performing each inference optimization procedure with an iterative amortized inference model, we obtain a computationally efficient implementation of the algorithm, which we call amortized variational filtering. We present experiments demonstrating that this general-purpose method improves performance across several deep dynamical latent variable models